Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Character Extraction from Interfering Background - Analysis of Double-sided Handwritten Archival Documents

Identifieur interne : 001B58 ( Main/Exploration ); précédent : 001B57; suivant : 001B59

Character Extraction from Interfering Background - Analysis of Double-sided Handwritten Archival Documents

Auteurs : Lim Tan [Singapour] ; Ruini Cao [Singapour] ; Qian Wang [Singapour] ; Peiyi Shen [Singapour]

Source :

RBID : ISTEX:7372B017B35B17D6C4DB67444EBEF0439147E5B2

Abstract

Abstract: The sipping of ink through the pages of certain doublesided handwritten documents after long periods of storage poses a serious problem to human readers or OCR systems. This paper addresses this problem through the recovery of content on the front side of a page from the interfering image caused by the handwriting on the reverse side. First, by adapting the Gaussian stochastic model, the interfering model based on norm-orientation-discontinuity is proposed in analyzing the properties of the interfering strokes. Secondly, an improved canny edge detector with edge norm-orientation similarity constraint is applied. At the same time, two low thresholds are used to detect edges instead of a single low threshold. This improvement could link weaker foreground edges without introducing noises in the overlapping/overshadowed area. The proposed algorithms perform well regardless of the intensity differences between the image on the front side and the interfering image from the reverse side. The segmentation results of real images are shown and evaluated

Url:
DOI: 10.1007/3-540-44732-6_10


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Character Extraction from Interfering Background - Analysis of Double-sided Handwritten Archival Documents</title>
<author>
<name sortKey="Tan, Lim" sort="Tan, Lim" uniqKey="Tan L" first="Lim" last="Tan">Lim Tan</name>
</author>
<author>
<name sortKey="Cao, Ruini" sort="Cao, Ruini" uniqKey="Cao R" first="Ruini" last="Cao">Ruini Cao</name>
</author>
<author>
<name sortKey="Wang, Qian" sort="Wang, Qian" uniqKey="Wang Q" first="Qian" last="Wang">Qian Wang</name>
</author>
<author>
<name sortKey="Shen, Peiyi" sort="Shen, Peiyi" uniqKey="Shen P" first="Peiyi" last="Shen">Peiyi Shen</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:7372B017B35B17D6C4DB67444EBEF0439147E5B2</idno>
<date when="2001" year="2001">2001</date>
<idno type="doi">10.1007/3-540-44732-6_10</idno>
<idno type="url">https://api.istex.fr/document/7372B017B35B17D6C4DB67444EBEF0439147E5B2/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000C95</idno>
<idno type="wicri:Area/Istex/Curation">000C72</idno>
<idno type="wicri:Area/Istex/Checkpoint">001212</idno>
<idno type="wicri:doubleKey">0302-9743:2001:Tan L:character:extraction:from</idno>
<idno type="wicri:Area/Main/Merge">001C51</idno>
<idno type="wicri:Area/Main/Curation">001B58</idno>
<idno type="wicri:Area/Main/Exploration">001B58</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Character Extraction from Interfering Background - Analysis of Double-sided Handwritten Archival Documents</title>
<author>
<name sortKey="Tan, Lim" sort="Tan, Lim" uniqKey="Tan L" first="Lim" last="Tan">Lim Tan</name>
<affiliation wicri:level="4">
<country xml:lang="fr">Singapour</country>
<wicri:regionArea>School of Computing, National University of Singapore, 119260, Lower Kent Ridge Crescent</wicri:regionArea>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Singapour</country>
</affiliation>
</author>
<author>
<name sortKey="Cao, Ruini" sort="Cao, Ruini" uniqKey="Cao R" first="Ruini" last="Cao">Ruini Cao</name>
<affiliation wicri:level="4">
<country xml:lang="fr">Singapour</country>
<wicri:regionArea>School of Computing, National University of Singapore, 119260, Lower Kent Ridge Crescent</wicri:regionArea>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Singapour</country>
</affiliation>
</author>
<author>
<name sortKey="Wang, Qian" sort="Wang, Qian" uniqKey="Wang Q" first="Qian" last="Wang">Qian Wang</name>
<affiliation wicri:level="4">
<country xml:lang="fr">Singapour</country>
<wicri:regionArea>School of Computing, National University of Singapore, 119260, Lower Kent Ridge Crescent</wicri:regionArea>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Singapour</country>
</affiliation>
</author>
<author>
<name sortKey="Shen, Peiyi" sort="Shen, Peiyi" uniqKey="Shen P" first="Peiyi" last="Shen">Peiyi Shen</name>
<affiliation wicri:level="4">
<country xml:lang="fr">Singapour</country>
<wicri:regionArea>School of Computing, National University of Singapore, 119260, Lower Kent Ridge Crescent</wicri:regionArea>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Singapour</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2001</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">7372B017B35B17D6C4DB67444EBEF0439147E5B2</idno>
<idno type="DOI">10.1007/3-540-44732-6_10</idno>
<idno type="ChapterID">10</idno>
<idno type="ChapterID">Chap10</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: The sipping of ink through the pages of certain doublesided handwritten documents after long periods of storage poses a serious problem to human readers or OCR systems. This paper addresses this problem through the recovery of content on the front side of a page from the interfering image caused by the handwriting on the reverse side. First, by adapting the Gaussian stochastic model, the interfering model based on norm-orientation-discontinuity is proposed in analyzing the properties of the interfering strokes. Secondly, an improved canny edge detector with edge norm-orientation similarity constraint is applied. At the same time, two low thresholds are used to detect edges instead of a single low threshold. This improvement could link weaker foreground edges without introducing noises in the overlapping/overshadowed area. The proposed algorithms perform well regardless of the intensity differences between the image on the front side and the interfering image from the reverse side. The segmentation results of real images are shown and evaluated</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Singapour</li>
</country>
<orgName>
<li>Université nationale de Singapour</li>
</orgName>
</list>
<tree>
<country name="Singapour">
<noRegion>
<name sortKey="Tan, Lim" sort="Tan, Lim" uniqKey="Tan L" first="Lim" last="Tan">Lim Tan</name>
</noRegion>
<name sortKey="Cao, Ruini" sort="Cao, Ruini" uniqKey="Cao R" first="Ruini" last="Cao">Ruini Cao</name>
<name sortKey="Cao, Ruini" sort="Cao, Ruini" uniqKey="Cao R" first="Ruini" last="Cao">Ruini Cao</name>
<name sortKey="Shen, Peiyi" sort="Shen, Peiyi" uniqKey="Shen P" first="Peiyi" last="Shen">Peiyi Shen</name>
<name sortKey="Shen, Peiyi" sort="Shen, Peiyi" uniqKey="Shen P" first="Peiyi" last="Shen">Peiyi Shen</name>
<name sortKey="Tan, Lim" sort="Tan, Lim" uniqKey="Tan L" first="Lim" last="Tan">Lim Tan</name>
<name sortKey="Wang, Qian" sort="Wang, Qian" uniqKey="Wang Q" first="Qian" last="Wang">Qian Wang</name>
<name sortKey="Wang, Qian" sort="Wang, Qian" uniqKey="Wang Q" first="Qian" last="Wang">Qian Wang</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001B58 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001B58 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:7372B017B35B17D6C4DB67444EBEF0439147E5B2
   |texte=   Character Extraction from Interfering Background - Analysis of Double-sided Handwritten Archival Documents
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024